Name | Version | Summary | date |
dataprobe |
1.0.0 |
Advanced data pipeline debugging and profiling tools for Python |
2025-07-28 18:26:25 |
desu |
0.1.5 |
Lightweight utilities for data science, data engineering, and data analysis projects. |
2025-07-28 00:11:44 |
rushdb |
1.10.0 |
RushDB Python SDK |
2025-07-27 12:13:16 |
sql-testing-library |
0.15.0 |
A powerful Python framework for unit testing SQL queries across BigQuery, Snowflake, Redshift, Athena, Trino, and DuckDB with mock data |
2025-07-27 00:50:28 |
schemaworks |
1.2.2 |
A schema conversion toolkit for JSON, Spark, PyIceberg and SQL formats. |
2025-07-22 19:43:13 |
jsonstat-validator |
0.2.2 |
A Python validator for the JSON-stat 2.0 standard format, based on Pydantic. |
2025-07-20 03:04:24 |
dagster-postgres-pandas |
0.2.3 |
PostgreSQL I/O manager for Dagster with Pandas DataFrame support |
2025-07-19 07:25:07 |
lakehouse-engine |
1.26.0 |
A configuration-driven Spark framework serving as the engine for several lakehouse algorithms and data flows. |
2025-07-15 10:23:48 |
dataghost |
0.1.1 |
Time-Travel Debugger for Data Pipelines |
2025-07-09 03:00:50 |
snowpark-checkpoints |
0.2.1 |
Snowflake Snowpark Checkpoints |
2025-04-07 13:40:41 |
airflow-parse-bench |
1.0.1 |
Easily measure and compare your Airflow DAGs' parse time. |
2025-01-26 03:39:23 |
glue-utils |
0.9.1 |
Reusable utilities for working with Glue PySpark jobs |
2024-11-14 10:57:20 |
extralo |
0.17.4 |
ETL for Python |
2024-11-11 19:33:06 |
nested-data-helper |
1.0.2 |
Help you find the data nested deep in your data |
2024-10-25 15:14:02 |
pydeequ |
1.3.0 |
PyDeequ - Unit Tests for Data |
2024-04-26 20:35:24 |
compars |
0.0.0 |
DataFrame comparison done right (AKA the Bear-agnostic DataFrame comparison library) |
2024-04-20 18:28:36 |
grizzlys |
0.0.1 |
Python DataFrames powered by Julia |
2024-04-14 19:18:11 |